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ABSTPACT I - ' , 

Starting with the premise that the construct-orieht ed 
approach is the only viable approach' to personality assessment,, this 
paper considers five propositions.. First, a prerequi'site to 
genera'lizable and vali'd psychometric measurement of, personality rests 
on the choice^ of broad-based constructs with systematic univocal 
definitions. Next, measures will be likely to. demonstrate convergent 
and discriminant validity and relative freedom from desirable bias 
if, and only ifp 1?hese^ requirements are incorporated into test 
construction i> Third, untrained item writers produce more ^valid 
personality scales than do empirical procedures using extant pools. 
Fourthf convergent and discriminant validity-of personality measures 
is' demonstrable even in the face of confounded method variance, and 
last, a potentially fruitful avenue for further investigation is the 
demonstration of a number of reproducible sets of psychopathglogical 
types. Evidence supporting each of the above propositions is 
di-scussed. Some areas »of research needing attentia^v. are outlined. 
(Author) 
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Discriminantly. Valid Personality Measures: 

^ ' ' . 1 . " ' 

Some Propositions ' ; . • i 

Dougl as. N. Jackson 

• University of Western Ontario 

■ . ' • ' I* . 

In 1957--18"years ago--at an APA convention.! gaVe a paper-~co- 
authored by Messick--enti tied Content and Style in Personality A ssessment> . 
This paper was interpreted as advanci ng . the h^v^pothe^is that the major 
response dimensions of some widely used personality tests were identifiable 
as response 'styles. But the paper carried another message, one that is 
even more important for personality assessment than that of response ■ 
styles. It is the idea that personality ass-essment involves-'^tHe appraisal 
of content^ referring "to response consistencies in certain defined 
assessment situations whi^ch reflect a p-articular set of broader behavioral 
tendencies^ relatively enduring over time, having as [its] basis some 
unitary trait, need state, attitudinal or belief disposition, or psycho- 
pathological - syndrome". (Jackson & Messi ck , ■ 1 962) . The suggestion that what 
is to be measured^in per'sonal i ty assessment should reflect content,, so 
defined,--04igMr-f)at to be very revolutionary, were it' not for a tradition 
-of radical empiricism. which left most personality assessment specialists 
\jnable and/or unwilling to admit to an interest in measuring content and / 
n1ar"I<edly^ suspi cious of anyone-else's claim of having done so. The' state 
of knowledge in personality and psychopathology, we were told, was not" so 
advanced as to permit unequivo.cal identification of the relationship between 
responses to item content and the underlying dynamics of trait dimensions. 



Furthermore 3 when investigators such as Campbell and Fiske (1959) sought to 
evaluate the degree to which putative trait measures' were general izable • 
across methods of measurement, the results were almost uniformly dis- 
couraging. It is little wonder that many psychologists dispaired of the 
possibilities for valid aiid s^ocially useful measurement in personality-. 
' ^. But the emergence of the thinking behihd/the notion of construct 
•validity a^nd the idea, .championed by Loevinger (1957), that there should be s 
theoretically deLfenstble relationship between . i tern content and the under- 
lying construct, provi ded an important alternative to the radical empiricism, 
implicit in building personal ity scales out of items found to discriminate 
particular cri^terion groups.. What would happen if personality scales were 
developed on the basis of a frankly rational appro^ach which created an • 
'item, pool by writing items so that they spanned all identifiable facets 
of an explicit definition of a trait, including various likely manifestations 
and evoking si tua^tions . In 1960 I began a program of^ persona! i ty scale 
research- and construction in wh-ich I have had the opportunity to review 
more than '15 thousand Items". Much, in the way of cumulative -knowledge 
about personality assessment has emerged from this experience. One of. 
the strongestXimpress'ions to emerge from this experience is in the potency' ^ 
of human judgme^nt of item content, and in the importance of basing 
personality asse^ment upon well-thought-out constructs. Thus, my first 
proposition is tha\ • * ^ . . * 

A prerequisite to general i zabl e and '^al.id psychometric 

measurement of\^ersonality dimensions rests on the choice ■ • 

of broad-based cd^istructs with systematic univocal 

definitions. ' \ 



Notice that I did not equivocate and say 'one/approacli! to valid measurement 
is....' Rather, I.haVe implied that this is the only! viable- approach. The 
alternative of,. for example, the use of narrow, situation specific constructs 
is at best inefficient, in that> it will require a very large number of 
scales for si tdation-trai t facets, and at worst wi^li fail entire-ly because 



very little in the way of systematic theory is "available about how situations 
should be categorized and how precisely they may interact with traits. 

. There are .times when one may with impunity move from one level of 
abstraction to anotjier. For example., Jackson, Hourany and Vicimar (1972)' 
undertook a conceptual analysis of • the construct of Risk Taking. They 
hypothesized four facets: Monetary, Physical, Social, and Ethical Risk 
Taking-. They undertook to measure each facet wi th fi ve different methods 
of meastrrement^Jr . Jactor a'n:a1ytic treatment of the data revealed four distinct 
•factors,, repres&li.ti ng tifl .four facets of risk taking. The pattern of factor 
.'loadings was virtual/ly cortipletely consi stentiwi th expectations, regarding the 
convergent and discriminant validity of the /four facets. However, when a 
second-order factol*. analysi s was undertaken, clear evidence for a general. - 
Risk Taking factor was uncovered, 'accounting for 80 per cent of the first- * 
order factor var^^nce. • In the case 'of Risk Taking, one could do reasonably' 

! ■ .. 

well predicting b^haVipr relevant to sub-species of Risk Taking by using a 

general measure. I • 

\ ■ ■ 

The situatiion is "different for Achievement, long rega'^rded as a unitary 
dimension. Jackson, Ahmed, dnd Heapy (1976) undertook a "conceptual analysis 
of Achievement and hypothesized six distinet facets: Status with Experts, 
Acqiiis^i tiveness, Achievement via Independence, Status with Peers, 
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.Competitiveness, and Concern for Excellence. Five methods of measuring ' 
each of thesa- facets were devised and administered to- a large group of 

suy'ects. Again factor analytic results revealed a pattern of factor 

• ■ / ■ . ' • ■ 

/Toa;dings highly consistent with convergent and discriminant properties / 

for each of these si*x facets. This- indicates that the achievement : construct 

' ■ ■ . ^ ■ ■ L- ■ / 

as described by McClelland and others is actually separable into distiptt 

components , each of which can be isolated, in psychological measures . / Even 

when a further factor analysis was undertaken of the first-order .factor • • 

intercorrelation matrix, three factors emerged, indicating that even at a 

higher level of abstraction there was no confirmation of a 'single dimension 

of Achievement. ' . .c. ■ i 

Results such' as those reported indicating evidence for converg'ent 

and discriminant val idity are typical of many, similar findings obtained 

using personality inventories that we have developed/. In contrast to the 

experience of maqiy other investigators, our relative success, is, ! submit, 

■ / ' ' 

the result of our approach to test cons trdcti/dn". Our suggestions for 

^' " . . * :\ / * ■ 

constructing personality assessment devic^s^are fairly simple, as suggested 
by the second proposition: ,/ ^ , » 

' Measures will be likely to demonstrate convergent and dis- 
criminate validity" and re'Kative freedom from desirability 

bias if, and only if , ^tfiese. requirements are incorporated- 

/' ' *' , 

' . , into test c&nstruction . • ' ^' . ^ ' 

, . -."/■■' ' ■ . 

• To. construct a personality assessment device, one need only select 

an appropri^ate 'dimen'sion,' think about it in terms of its manifestations and 

in terms, of its /relation to other constructs, carefully define it, and 

prepare an item pool. Items should be balanced in terms of likely 



manifestations of the trait, and in terms of negative and posi ti ve instances. 
Even if one is preparing only a single scale, it is -better to consider items 
-in relation to a multi-scale. battery. This is so becausd it forces one to 
consider not only, the item's substa.ntive link to its own scale,- but its 
differentiation from distinct scales. Items should have a strong , conceptual 
link to only one scale. _ " / ^ 

We have been taught to be suspicious of such heavy reliance on' ly 
rationality. Is it 1 ikely • that-a psychologist can simp*ly look at an item 
and determine what scale it is keyed on? Our experience is illustrative. 
An analysis was undertaken "(Jackson , 1971) with the item pool comprising the ^ 
Personal ity 'Research Form., and in particular the degree to which an item 
written for' a particular scale correlated highly with an irrelevant scale. 
In the entire set of "over 9Q0 items,^ only five failed in this respect, al- 
though, to be sure, more were found wanting in other respects. The percentage 
of hits using this criterion was 99.9. This is strong evidence, I think, that 
substantive analysis alone can contribute much to the discriminant properties 
of scales. . ' . 

The recommended procedure- i s to prepare a matrix of correlations be- 
•tween each item and each of^.a substantial number of irrelevant scales, in- 
cluding a desirability scale. When an item correlates too. highly wi th an 
irrelevant; scale, discard the item. Actually this procedure can be refined 
by partialling desirability variance out of the variance associated with 
both item and..tdlal scale score, and through the use of orthogonal factor 
scores rather than, raw scale scores. We have also developed algorithms to ^ 
r,educe scale intercorrelations. " 



•In general what I am suggesting is that if items are selected to be. 
associated with only their own -scale, if scales are constituted so they will 
have only minimum^ mutual redundancy ^ if items bear an important substantive 
^relation to a scale definition, if response biases are suppressed in scale 
construction V a likely result is the development of 'scales showing worthwhile 
levels of convergent and discriminant validity. I have with my collaborators 
completed about ten studies of convergent and discriminant validity involving 
a number of different sets, of scales--al 1 but one yielded confirmatory 
evidence. The single exception was when we attempted to have prison guards 
Judge the personalities of their wards. They seemed to be yal i dly sensi ti ve * 
to only one .dimension--that of Hostility, . ^ . 

The approach I am -Advocating places a heavy reliance on judgment. We 

have relied on the judgment of thbse^relatively experienced in personality 

and psychological theory. The question arises %s to whether or not this is 

a necessary precondition for obtaining the sort of results that I have reported 

I think not. In fact, judg^ing froijv^the -overal 1 history of personality scale 

construction, psychological training might even be a disadvantage. I believe 

that persons of hardly more than average sophistication in psychology can 

draw valid inferences about personality items. -Let me give you an example 

'J - 

of a multidimensional scaling analysis based upon two independent groups of 
Canadian Forces officer candidates,. We asked these officer candidates to . ' 
judge the mutual relations -between 'p.ersons , between items, and between persons 
and items, for example, by asking them 1:o judge the probability that a certain 
P'erson., whose description. was provided, would respond true to a given item. 
When our subjects were split randomly into two ..groups, and entirely independ- 
ent, multidimensional scaling analyses v/ere conducted T)n each sample, "it'w'as 



found that all items and persons showed highest projections on the appropriate*? • 
hypothesized dimension of Dominance, Autonomy, or Impulsivity. These sc^le 
values for items and persons replicated across sample^ substantially as i-n~ 

■ f- 

dicated by correlations of ' .99, .99, and .98, respectively. 

Now, let's turn to Proposition 3. . . . ' • ^ 

Untrained item writers produce more valid personality- 
scales than do empirical procedures us.ing extant pools. - 
In 1971 (Jackson, 1971) I issued a challenge to investigators to undertake suVh . 
a comparison. Ashton and Goldberg (1973) accepted the challenge ai^d uncovered 
considerable support for the proposition above. My own study (Jackson, 1975) 
evaluated.it further. What I did was to select three personality s'cales-- 
Tolerance, Sociability, and. Self Esteem—and assign one of them randomly to eael; 

4 " . ' " 

of 23 students in an undergraduate course with instructions to write 16 items 

• ■ / ..^ . / - : - 

r^Jeyant to a. definition given. We administered these student-construct^^scales 
together with similar scales drawn from the Jackson Personal i ty Inventory and 

' - / 

. " - / . 

the California Psychological Inventory. Jhe latter device is considered, .by some 

r ' 7 : 

psychoTfc)gists to be the finest example of empirical scale cons;truction . . Validity 
,was evaluated by computing correlations between each of th^experimental and 
comparison scales and self- -and peer-rating criteria ba/ed on 116 females com- 
prising roommate^ pai rs . Data revealed that the magnitude of the average 
validities of student-constfucted scales was about three times higher than those 
derived from the CPI. . 

I would like to suggest that the basis on which relatively novice 
persons can accurately judge and write items is by using a^ shared trai t 
inferential networ k. This is conceptual ized -as an n-dimensional Euclidean 
space in which traits are organized in terms of -their impl icative relations, 
which in turn are a function af their actual co-occurrence. <Recent evidence. 



for. such a shared network was devel oped "by Reed and Jackson ( 1 975) , who 
demonstrated that groups of judges showed correlations in excess' of .98 in 
ascribing personality scale responses to certain psychopathological types^ 
descriptions of which -they had been, given. Furthermore, there' was a Strang 
evidence 'that their judgments were accurate in reflecting' actual responses. 
Once relatively homogeneous scales of personality of psychopathology 

9 

have been identified, the question arises as to whether or not one can identify 
cluster of persoas showing unique patterns of high. and low scorgs. Our 
experience from a number of analyses is that s-uch unique profiles are 
_ identifiable and can be replicated. Let me give an illustration-. Skinner, 
'Jackson^ and Hoffmann (1974) administered the Differential Personality 
Inventory to psychiatri cal ly-hospi tal i zed al cohol ics and decomposed the data ■ 
matrix in such a way that similarities in profile shape were oidenti.fied. The 
criterion for identifying a comman modal profile type was that .it be replicated 
across three sub-samples. Eiqht such bipolar modal- profile types were found, 
each quite different Jrom the average profile for^^aH\alc6holics. Further * 
•studies have revealed that the proportion of devi a.nt types varies in different 
normal and pathological samples. A fruitful avenue for investigation would 
be the- use of types so isolated in studies of prognosis and the differential, 
effects of treatment. . ' ' 

The Future of Per s onalit y Assessment . 

What does the future hol'd for personal i ty assessment? Rather- than 
prognosticate regarding the possible course of developments ' i n personality 
assessment, let me simply list a few of the areas in which I believe research 
is urgently needed. ' , ■ ' - • 

First, in regard to the study of si tuations , how -they influence 
behavior and interact with traits, and for that matter, how persons in- 
fl-uence situations, we are in a vast. sea of ignorance. Although we have 



been reminded frequently of the importance of si tuational .determinants at 
'■ least .since theN935 Murchfson :lajLdbook_qf_So_(^^ and although 

hard data are frequently promised, they are rarely delivered! Rarely have 
there been attempts to classify and measure situational contexts paranelling 
. the- many aftt'ipts at identifying the important dimensions of personality.' 

It. is, of course, very difficult and expensive to devise life-like situational 
"■'Contexts for assessment -purposes . One promising approach is the use of 
verbally elicited responses to materials having the democistrated tendency 
to elicit a certain' class" of behavior. It .is possible to treaf large numbers 
of such items in the sam'e analyti cal^ and mathemati cal ways' i n which we treat 
personality' items, providing a b.asis for the 'emergence of valid generalizations. 

Secondly, the methods^ by which we validate personality assessfnent ^ ' 
devices need further investigation.' In a sense, a mul ti trai t-itiul timethod 
matrix can be looked upon as an evaluqition of the degree to which personality 
_trait$ are general izabl e over situations. But how much is known about the' 
typical set of criterion measures? The-'usual procedure is to spend several 
years carefully devising a personality questionnaire, but only a ^few minutes 
with ou)' criteria. Of eoursa, one way to obtain a handle on the identification" 
of. uncontaminated trait variance is to seek to* understand the sources'of 
method variance which inevitably contaminate our criteria. 

Third, we should seek new te'chniques for measuring personal i ty ./"apart 
from the printed questionnaire ./l am referring to techniques wl>t€h 'are 
structured and possibly even c^ntri ved/but which do not r^Uire'a self 
report. The Internation-Simulation of Guetzkow and^Mrryholmes (1965)',. In 
which individuals pUy the rol e" of .government l^^a^ders in prescribed situation^/ 
strike's me as promising for allowing the ejR^rgence of personal qualities, / 
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• Possibly othGr''!<inds o.f similar simulations in-which a computer terminal i^s 

an integral part may emerge. • ' ; 

ft 

About dne'tfhing I am relatively certain; Personality assessment has 
outgrown its dependence on- ad hoc and theoretically vacuous techni^ques such 
as ernpi^ical scale construction with respect to external cri-teria. .It is 
important to know something psychological Ty about wha,t we are measuring! I 
for on^ have confidence, that we-shal\l continue to make progress measuring 
psychologically important variables. ^/ . ^ 
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